First Classified Annotated Bibliography of NLP Tasks in the Burmese Language of Myanmar
نویسنده
چکیده
Natural Language Processing (NLP) has emerged with a wide scope of research in the area. The Burmese language, also called the Myanmar Language is a resource scarce, tonal, analytical, syllable-timed and principally monosyllabic language with Subject-Object-Verb (SOV) ordering. NLP of Burmese language is also challenged by the fact that it has no white spaces and word boundaries. Keeping these facts in view, the current paper is a first formal attempt to present a bibliography of research works pertinent to NLP tasks in Burmese language. Instead of presenting mere catalogue, the current work is also specifically elaborated by annotations as well as classifications of NLP task research works in NLP related categories. The paper presents the state-of-the-art of Burmese NLP tasks. Both annotations and classifications of NLP tasks of Burmese language are useful to the scientific community as it shows where the field of research in Burmese NLP is going. In fact, to the best of author’s knowledge, this is first work of its kind worldwide for any language. For a period spanning more than 25 years, the paper discusses Burmese language Word Identification, Segmentation, Disambiguation, Collation, Semantic Parsing and Tokenization followed by Part-Of-Speech (POS) Tagging, Machine Translation Systems (MTS), Text Keying/Input, Recognition and Text Display Methods. Burmese language WordNet, Search Engine and influence of other languages on Burmese language are also discussed.
منابع مشابه
Parsing Myanmar (Burmese) by Using Japanese as a Pivot
As Myanmar (Burmese) and Japanese share similar syntactic structures, we explore an approach for parsing Myanmar by using Japanese as a pivot. Specifically, we first apply a statistical machines translation (SMT) system to translate Myanmar sentences into Japanese, and then we parse the Japanese translations. Finally, the Japanese syntactic structures are mapped to the original Myanmar sentence...
متن کاملEmpirical Dependency-Based Head Finalization for Statistical Chinese-, English-, and French-to-Myanmar (Burmese) Machine Translation
We conduct dependency-based head finalization for statistical machine translation (SMT) for Myanmar (Burmese). Although Myanmar is an understudied language, linguistically it is a head-final language with similar syntax to Japanese and Korean. So, applying the efficient techniques of Japanese and Korean processing to Myanmar is a natural idea. Our approach is a combination of two approaches. Th...
متن کاملFeasibility of Implementing of R2P in Myanmar
"Killing men, shooting them, slaughtering children, raping women, burning and looting houses" these are only some human rights crimes that occur against Muslims in Myanmar but up to now, none of the international community has made a substantial contribution. In 2001, International Law Commission realized that the UN charter did not have much success in overcoming human rights violations,...
متن کاملFunction Tagging for Myanmar Language
Function tagging is one of the essential steps in Myanmar to English machine translation system. In this paper we propose a set of function tags for Myanmar and address the question of assigning function tags to Myanmar words. A small functional annotated tagged corpus manually serves as the training data because the large scale Myanmar Corpus is unavailable at present. Part of the challenge of...
متن کاملروشی جدید جهت استخراج موجودیتهای اسمی در عربی کلاسیک
In Natural Language Processing (NLP) studies, developing resources and tools makes a contribution to extension and effectiveness of researches in each language. In recent years, Arabic Named Entity Recognition (ANER) has been considered by NLP researchers due to a significant impact on improving other NLP tasks such as Machine translation, Information retrieval, question answering, query result...
متن کامل